Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 



421 Thr * * *A1 aGluGlyGluAla* * * * * *LeuAspLeuArgGly * * *Hi sPheGlnLeuLeu 44 0 

III lllllllllllllll lllllllllllllll IIIIIIIIIIIMII 

12 61 ACAGGGGCTGAAGGGGAAGCAAGGCCTCTTGATCTTAGGGGACAACATTTTCAACTTCTC 132 0 



441 



460 



ProPheGlySerGlyArg* * *Me tCysProGlyVal * * *LeuAlaThrSerGly * * * Ala 

IIIIIMIIIIIIMIII lllllllllllllll lllllllllllllll III 

1321 CCATTTGGGTCTGGGAGGAGAATGTGCCCTGGAGTCAATCTGGCTACTTCGGGAATGGCA 13 8 0 
4 61 ThrLeuLeuAlaSerLeuIleGlnCysPheAspLeuGlnValLeuGlyProGlnGlyGln 4 80 

III III II Mill II Illlllllllll II III MM IM II III II II Mill II III II 

13 81 ACACTTCTTGCATCTCTTATTC AGTGCTTTGACTTGC AAGTGCTGGGTCCACAAGGAC AG 144 0 



481 



500 



IleLeuLysGly***AspAlaLysValSerMetGluGliiArgAlaGlyLeuThrValPro 

IIIIMIIIIII IIIIIMIIIMIIMIIIMMIIIIIMIIIIIIMIIMIII 

1441 ATATTGAAGGGTGGTGACGCCAAAGTTAGCATGGAAGAGAGAGCCGGCCTCACTGTTCCA 1500 



501 



520 



ArgAlaHisSerLeuValCysValProLexiAlaArglleGlyValAlaSerLysLeuLeu 

MIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIM 

1501 AGGGCACATAGTCTTGTCTGTGTTCCACTTGCAAGGATCGGCGTTGCATCTAAACTCCTT 1560 



Qy 521 Ser 521 



Db 1561 TCT 1563 



RESULT 9 
AF135484 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE . 

JOURNAL 
MEDLINE . 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



gene 
CDS 



AF135484 1722 bp mRNA linear PLN 02-AUG-1999 

Glycine max cytochrome P450 monooxygenase CYP93Clv2p (CYP93Clv2) 
mRNA, complete cds . 
AF135484 

AF135484.1 GI:5059123 

Glycine max. 
Glycine max 

Eukaryota ; Viridiplant ae ; S treptophyta ; Embryophyt a ; Tracheophyta ; 
Spermatophyta; Magnoliophyta; eudicotyledons ; core eudicots; 
Rosidae; eurosids I; Fabales; Fabaceae; Papilionoideae ; Phaseoleae; 
Glycine . 

1 (bases 1 to 1722) 

Steele, C.L., Gij2en,M., Qutob,D. and Dixon, R. A. 

Molecular characterization of the enzyme catalyzing the aryl 

migration reaction of isoflavonoid biosynthesis in soybean 

Arch. Biochem. Biophys . 367 (1), 146-150 (1999) 

99306846 

10375412 

2 (bases 1 to 1722) 
Steele, C.L., Gijzen,M., 
Direct Submission 
Submitted ( 17 -MAR- 1999) 
Noble Pkwy, Ardmore, OK 734 02, USA 

Location/Qualif iers 
1. .1722 

/organism= "Glycine max" 
/ db_xr e f = " t axon : 3 8 4 7 " 
1. .1722 

/gene="CYP93Clv2 " 
36. .1601 



Qutob,D. and Dixon, R. A. 

Plant Biology, Noble Foundation, 2510 Sam 



/gene=" CYP93Clv2 " 

/ function=" 2 -hydroxy i so flavanone synthase" 

/note="The functional expression of this cDNA demonstrated 

2 -hydroxyi so flavanone synthase activity" 

/codon_start=l 

/product=" cytochrome P4 50 monooxygenase CYP93Clv2p" 
/protein_id= "AAD38929. 1" 

/db_xref="GI: 5059124" y / 

/translation="MLLELALGLLVLALFLHLRPTP'f7^ ^ 

IGHllHLLI^KLJjHYALIDLSKim^ 

TRFQTSAfRR^^YDS^AIWpV^ 

R)mAQ^^A^PLD6TEEL.LKWTNSTraMMMLGEA^ 



YSWGKDI^VpE^^TQNLPYIRAIVK^ 

' ft^^mwovGra j4ywdrp S E 

GVN^ATSGI^TLLASLIQCFDLQVLGPQGQILKGGDAKVSMEERAGLTVPRAHSLVCV 



PLARIGVASKLLS" 
misc_dif f erence 445 

/gene="CYP93Clv2" 

/note="diff erence compared to GenBank Accession Number 
AF022462; causes change of P to L (aa. 140) " 
/replace="c" 
misc_diff erence 502 

/gene="CYP93Clv2 " 

/note="diff erence compared to GenBank Accession Number 
AF022462; causes change of T to I (aa 156)" 
/replace="c" 
misc_diff erence 918 

/gene="CYP93Clv2" 

/note="diff erence compared to GenBank Accession Number 
AF022462 ;. causes change of E to K (aa 295)" 
/replace="g" 
misc_diff erence 1612. .1620 

/gene=" CYP93Clv2 " 

/note="9 bp insertion compared to GenBank Accession Number 

AF022462" 

/replace="" 

BASE COUNT 470 a 422 c 407 g 423 t 

ORIGIN 



Alignment Scores: 



Pred. No. : 


1.52e-259 


Length : 


1722 


Score : 


2267 .00 


Matches : 


454 


Percent Similarity: 


87. 14% 


Conservative : 


0 


Best Local Similarity:. 


87.14% 


Mismatches : 


67 


Query Match: 


94.62% 


Indels : 


0 


DB: 


8 


Gaps : 


0 



US-09-857-581-66 (1-521) x AF135484 (1-1722) 

Qy 1 MetLeuLeuGluLeuAlaLeuGlyLeu***ValLeuAlaLeuPhe***HisLeuArgPro 20 

IIIMIIIIIIIIIIIIIIIIIIIIII lllllllllllllll llllllllllll 

Db 3 6 ATGTTGCTTGAACTTGCACTTGGTTTATTGGTTTTGGCTCTGTTTCTGCACTTGCGTCCC 95 

Qy 21 ThrPro***Ala***SerLysAlaLeuArgHisLeuProAsnProProSerPro***Pro 40 

mill III MM lllllllllllllll III III llllllllllll II Ml 



Db 96 ACACCC|vCt|3CA^^^CAAAAGCACT 155 

Qy 41 ArgLeuProPheIleGlyHis***HisLeuLeuLysAspLysLeuLeuHisTyrAla**^^ 

' lllllllllllllllllllll IIMIIIIIIIIIIIIIMIIIIIIIIIIIIII 

Db 156 CGTCTTCCCTTCATAGGACACCTTCATCTCTTAAAAGACAAACTTCTCCACTACGCACTC 215 
Qy 61 IleAspLeuSerLysLysHisGlyProLeuPheSer* * * * **^PheGlySerMetProThr 8 0 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii iiiiiiiiiiiiiiiiii 

Db 216 ATCGACCTCTCCAAAAAACATGGTCCCTTATTCTCTCTGTACTTTGGCTCCATGCCAACC 275 

V' 

Qy 81 ValValAlaSerThrProGluLeuPheLysLeuPheLeuGln******GliiAlaThrSer 100 

llllllllllllllllllllllllllllllllllllllllll IIIIIIIMIII 
Db 2 76 GTTGTTGCCTCCACACCAGAATTGTTCAAGCTCTTCCTCCAAACGCACGAGGCAACTTCC 335 

Qy 101 Phe***ThrArgPheGlnThrSerAla***Arg***I>euThrTyrAsp******ValAla 120 

III lllllllllllllllllllll III IIIIIIIMIII mill 

Db 336 TTCAACACAAGGTTCCAAACCTCAGCCATAAGACGCCTCACCTATGATAGCTCAGTGGCC 3 95 

Qy 121 ******Pro*^*GlyProTyrTrp**^PheValArgLysLeuIleMetAsnAspLeuLeu 140 

III llllllllllll MIIIIIMIIIIIIIIIIIIIIIIIIIIIIII 

Db 3 96 ATGGTTCCCTTCGGACCTTACTGGAAGTTCGTGAGGAAGCTCATCATGAACGACCTTCTC 4 55 

Us 

Qy 141 AsnAlaThrThrValAsn**l*LeuArgProLeLiArgThrGlnGlnIleArgLys***Leu 160 

IIIIIIIIIIIIIIIIII MIIIIIIIIIIMIIIIIIIIIIIIIIIIIII III 

Db 456 AACGCCACCACTGTAAACAAGTTGAGGCCTTTGAGGACCCAACAGATCCGCAAGTTCCTT 515 

Qy 161 Arg***MetAlaGln**^AlaGluAla***LysProLeuAsp***ThrGluGluLeuLeu 180 

III lllllllll lllllllll llllllllllll lllllllllllllll 
Db 516 AGGGTTATGGCCCAAGGCGCAGAGGCACAGAAGCCCCTTGACTTGACCGAGGAGCTTCTG 575 

Qy 181 LysTrp***AsnSerThr***SerMetMet***LeuGlyGluAlaGluGluIleArgAsp 200 

mill lllllllll nil II Ml I II II II III II II III II II II III I 

Db 576 AAATGGACCAACAGCACCATCTCCATGATGATGCTCGGCGAGGCTGAGGAGATCAGAGAC 63 5 
Qy 201 IleAlaArgGluValLeuLysIle*^*^lyGluTyrSerLeuThrAspPheIle'*^^ 220 

lllllllll II Ml MM MM 1 1 II II III II MM Mill MM III II III 

Db 63 6 ATCGCTCGCGAGGTTCTTAAGATCTTTGGCGAATACAGCCTCACTGACTTCATCTGGCCA 695 
Qy 221 LeuLys***LeuLysValGlyLysTyrGluLysArgIleAspAspIleLeuAsnLysPhe 240 

mill IIIIIIIINIIIIIIIIIIIIIIIIIIIillll MM lllllllll MM 

Db 696 TTGAAGCATCTCAAGGTTGGAAAGTATGAGAAGAGGATCGACGACATCTTGAACAAGTTC 755 

Qy 241 AspProValValGluArgValIleLysLysArgArg***IleValArgArgArg**>Asn 260 

llllllllllllllllllllllllllllllllllll lllllllllllllll III 
Db 756 GACCCTGTCGTTGAAAGGGTCATCAAGAAGCGCCGTGAGATCGTGAGGAGGAGAAAGAAC 815 

Qy 261 GlyGlu******GluGlyGlu***SerGlyVal***LeuAspThrLeuLeuGluPheAla 280 

mm lllllllll lllllllll ii.iiiiiiiiiiiiiiiiiiiiii 

Db 816 GGAGAGGTTGTTGAGGGTGAGGTCAGCGGGGTTTTG:CTTGACA,CTTTGCTTGAATTCGCT 875 
Qy 281 GluAspGluThr***GluIleLysIleThrLys^***!:^j!^eIj^Gi^ 3 00 

llllllllllll iiiiiiiiiiiiiiiiiiV_jmTniiiiiiiiiiiiiii 

Db 876 GAGGATGAGACCATGGAGATCAAAATCACCAAGGACCACATCAAGGGTCTTGTTGTCGAC 93 5 

^ yU \\oLU(^ 

Qy 301 ***PheSerAlaGly***AspSerThrAla****^**ThrGluTrpAlaLe\iAlaGluLeu 320 

IIIIIIIMIII llllllllllll i IIIIIIIIIMIIIIIIIIIIIII 

Db 93 6 TTTTTCTCGGCAGGAACAGACTCCACAGCGGTGGCAACAGAGTGGGCATTGGCAGAACTC 995 



Qy 321 IleAsnAsnPro**^alLeJ^^^ 340 

llllllllllll IIMII IMIIIIIIIII Mllllllllllllllll 

Db 996 ATCAACAATCCTAAGGTGTTGGAAAAGGCTCGTGAGGAGGTCTACAGTGTTGTGGGAAAG 1055 
Qy 341 Asp^^euValAspGluValAspThrGlnAsnLeuProTyrlleArgAlalleValLys 360 

III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1056 GACAGACTTGTGGACGAAGTTGACACTCAAAACCTTCCTTACATTAGAGCAATCGTGAAG 1115 

Qy 361 GluThrPheArgMetHisProProLeuProValValLysArgLysCys***GluGluCys 380 

llllllillllllllillillllllllllllllllillllllllllll Illllllil 

Db 1116 GAGACATTCCGCATGCACCCGCCACTCCCAGTGGTCAAAAGAAAGTGCACAGAAGAGTGT 1175 



GAGACATTCCGCATGCACCCGCCACTCCCAGTGGTCAAAAGAAA( 



Qy 381 ***IleAsnGly**(SVal***ProGluGlyAlaLeu**^***PheAsnValTrpGlnVal 400 

Mlllllll III lllllllllllllll IIIIIIIIIMIIIIMI 

Db 1176 GAGATTAATGGATATGTGATCCCAGAGGGAGCATTGATTCTCTTCAATGTATGGCAAGTA 1235 
Qy 4 01 Gly^^^Asp^**LysTyrTrpAspArgProSerGlu^^*^rgProGluA^^ 42 0 

III III I II 1 1 II 1 1 II I II I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 

Db 123 6 GGAAGAGACCCCAAATACTGGGACAGACCATCGGAGTTCCGTCCTGAGAGGTTCCTAGAG 12 95 
Qy 421 Thr^^laGluGlyGluAla^W*^^LeuAspLeiiArgGly^^ 440 

III - 1 1 1 1 1 1 1 M 1 1 1 1 M lllllllllllllll lllllllllllllll 

Db 12 96 ACAGGGGCTGAAGGGGAAGCAGGGCCTCTTGATCTTAGGGGACAACATTTTCAACTTCTC 13 55 

JW \i^e)C 

Qy 441 ProPheGlySerGlyArg^*$'MetCysProGlyVal***LeuAlaThrSerGly***Ala 460 

MIMIIIIIIIIIIIII lllllllllllllll lllllllllllllll III 

Db 1356 CCATTTGGGTCTGGGAGGAGAATGTGCCCTGGAGTCAATCTGGCTACTTCGGGAATGGCA 1415 



Qy 


461 * 


Db 


1416 . 


Qy 


481 


Db 


1476 . 


Qy 


501 . 


Db 


1536 . 


Qy 


521 i 


Db 


1596 ' 


RESULT .10 


AF022462 


LOCUS 




DEFINITION 


ACCESSION 


VERSION 


KEYWORDS 


SOURCE 



ThrLeuLeuAlaSerLeuIleGlnCysPheAspLeuGlnValLeuGlyProGlnGlyGln 4 80 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
ACACTTCTTGCATCTCTTATTCAGTGCTTCGACTTGCAAGTGCTGGGTCCACAAGGACAG 14 75 



IleLeuLysGly^*'*^spAlaLysValSerMetGluGluArgAlaGlyLeuThrValPro 500 

Illllllil I MM Mill III II I II II III M ill III II I Mill II III 1 1 

ATATTGAAGGGTGGTGACGCCAAAGTTAGCATGGAAGAGAGAGCCGGCCTCACTGTTCCA 153 5 
ArgAlaHisSerLeuValCysValProLeuAlaArglleGlyValAlaSerLysLeuLeu 52 0- 

MM Mill II IIMII IMIIIIIM Mill III IMIIIII II III III Mill II 

AGGGCACATAGTCTTGTCTGTGTTCCACTTGCAAGGATCGGCGTTGCATCTAAACTCCTT 15 95 



III 



AF022462 1824 bp mRNA linear PLN 02-MAR-1999 

Glycine max cytochrome P450 monooxygenase CYP93Clp (CYP93C1) mRNA, 
complete cds . 
AF022462 

AF022462 .1 GI : 2 73 9005 

Glycine max. 
ORGANISM Glycine max 

Eukaryot a / Viridiplantae ; S t reptophyt a ; Embryophyta ; Tracheophyta ; 
Spermatophyta; Magnoliophyta ; eudicotyledons ; core eudicots; 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



gene 
CDS 



Rosidae; eurosids I; Fabales; Fabaceae; Papilionoideae ; Phaseoleae; 
Glycine . 

1 (bases 1 to 1824) 

Siminszky, B . , Corbin,F. T., Ward^E.R., Fleischmann, T . J . and 
Dewey, R.E. 

Expression of a soybean cytochrome P4 50 monooxygenase cDNA in yeast 

and tobacco enhances the metabolism of phenylurea herbicides 

Proc. Natl. Acad. Sci. U.S.A. 96 (4), 1750-1755 (1999) 

99145622 

9990096 

2 (bases 1 to 1824) 

Siminszky , B . , Dewey, R.E. and Corbin,F.T. 
Direct Submission 

Submitted (04 -SEP-1997 ) Crop Science, North Carolina State 
University, Box 7620, Raleigh, NC 27695, USA 

Location/Qualifiers 

1. .1824 

/organism="Glycine max" 
/ db_xr e f = " t axon : 3 8 4 7 " 
1. .1824 
/gene=" CYP93C1" 
54. .1619 
/gene="CYP93Cl" 

/note=" cytochrome P450 monooxygenase" 
/codon_start=l 
/product="CYP93Clp" 
/protein_id="AAB94591. 1" 
/db_xref="GI : 2739006" 

/translation="MLLELALGLLVLALFLHLRPTPTAKSKALRHLPNPPSPKPRLPF 
IGHLHLLKDKLLHYALIDLSKKHGPLFSLYFGSMPTWASTPELFKLFLQTHEATSFN 
TRFQTSAIRRLTYDSSVAMVPFGPYWKFVRKLIMNDLPNATTVNKLRPLRTQQTRKFL 
RVMAQGAEAQKPLDLTEELLKWTNSTISMMMLGEAEEIRDIAREVLKIFGEYSLTDFI 
WPLKHLKVGKYEKRIDDILNKFDPWERVIKKRREIVRRRKNGEWEGEVSGVFLDTL 
LEFAEDETMEIKITKDHIEGLWDFFSAGTDSTAVATEWALAELINNPKVLEKAREEV 
YSWGKDRLVDEVDTQNLPYIRAIVKETFRMHPPLPWKRKCTEECEINGYVIPEGAL 
ILFNVWQVGRDPKYWDRPSEFRPERFLETGAEGEAGPLDLRGQHFQLLPFGSGRRMCP 
GVNLATSGMATLLASLIQCFDLQVLGPQGQILKGGDAKVSMEERAGLTVPRAHSLVCV 
PLARIGVASKLLS" 



BASE COUNT 
ORIGIN 



497 a 



448 C 



416 g 



463 t 



Alignment Scores: 

Pred. No. : 1.3e-257 

Score: 2251.00 

Percent Similarity: 86.76% 
Best Local Similarity: 86.56% 

Query Match: 93.95% 

DB: 8 



Length: 1824 

Matches: • 451 

Conservative: 1 

Mismatches: 69 

Indels: 0 

Gaps : 0 



US-09-857-581-66 (1-521) x AF022462 (1-1824) 

Qy 1 MetLeuLeuGluLeuAlaLeuGlyLeu***ValLeuAlaLeuPhe***HisLeuArgPro 20 

1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M I 

Db 54 ATGTTGCTTGAACTTGCACTTGGTTTATTGGTTTTGGCTCTGTTTCTGCACTTGCGTCCC 113 



Qy 21 ThrPro***Ala***SerLysAlaLeuArgHisLeuProAsnProProSerRro***Pro 40 

iiirii III iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii III 



Db 



114 ACACCCACTGCAAAATCAAAAGCACTTCGCCATCTCCCAAACCCACCAAGCCCAAAGCCT 173 



Qy 41 ArgLeuProPheIleGlyHis***HisLeuLeuLysAspLysLeuLeuHisTyrAla*** 60 

IIIIMIMMIMIMIIII IMIIIIIIIIIIIMIIIIIMIIIIIIMII 

Db 174 CGTCTTCCCTTCATAGGACACCTTCATCTCTTAAAAGACAAACTTCTCCACTACGCACTC 2 33 
Qy 61 IleAspLeuSerLysLysHisGlyProLeuPheSer******PheGlySerMetProThr 80 

IIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMI IIIIIIIIIMIIIIIII 

Db 2 34 ATCGACCTCTCCAAAAAACATGGTCCCTTATTCTCTCTCTACTTTGGCTCCATGCCAACC 2 93 
Qy 81 ValValAlaSerThrProGluLeuPheLysLeuPheLeuGln******GluAlaThrSer 100 

IIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIMIIMIM MIIMIIIIII 

Db 2 94 GTTGTTGCCTCCACACCAGAATTGTTCAAGCTCTTCCTCCAAACGCACGAGGCAACTTCC 353 
Qy 101 Phe***ThrArgPheGlnThrSerAla***Arg***LeuThrTyrAsp******ValAla 120 

III Illllllllllllllllllll III llllllllllll mill 

Db 3 54 TTCAACACAAGGTTCCAAACCTCAGCCATAAGACGCCTCACCTATGATAGCTCAGTGGCC 413 

Qy 121 ******Pro***GlyProTyrTrp***PheValArgLysLeuIleMetAsiiAspLeuLeu 140 

III llllllllllll llllllllllllllllllllllllllllll 
Db 414 ATGGTTCCCTTCGGACCTTACTGGAAGTTCGTGAGGAAGCTCATCATGAACGAGCTTCCC 473 

Qy 141 AsnAlaThrThrValAsn***LeuArgProLeuArgThrGlnGlnIleArgLys***Leu 160 

I II III Ml II III II II Illlllllllllllllllllllll mill III 

Db 474 AACGCCACCACTGTAAACAAGTTGAGGCCTTTGAGGACCCAACAGACCCGCAAGTTCCTT 53 3 
Qy 161 Arg***iyietAlaGln***AlaGluAla***LysProLeuAsp***ThrGluGluLeuLeu 180 

IN IIIIIIIII IIIIIIIII llllllllllll 1 1 1 1 1 Ml 1 1 III 1 1 

Db 534 AGGGTTATGGCCCAAGGCGCAGAGGCACAGAAGCCCCTTGACTTGACCGAGGAGCTTCTG 593 
Qy 181 LysTrp***AsnSerThr***SerMetMet***LeuGlyGluAlaGluGluIleArgAsp 200 

mm IIIIIIIII IIIIIIIII iiiiiMiiiiiiiiiiiiiMiMM 

Db 594 AAATGGACCAACAGCACCATCTCCATGATGATGCTCGGCGAGGCTGAGGAGATCAGAGAC 653 
Qy 201 IleAlaArgGluValLeuLysIle***GlyGluTyrSerLeuThrAspPheIle***Pro 220 

iiiiiiiiiiiiiiiiiiiiim iiiiiiiiiiiiiiiiiimiiiiii Ml 

Db 654 ATCGCTCGCGAGGTTCTTAAGATCTTTGGCGAATACAGCCTCACTGACTTCATCTGGCCA 713 
Qy 221 LeuLys***LeuLysValGlyLysTyrGluLysArgIleAspAspIleLeuAsnLysPhe 240 

mm II Ml II Ml Ml II Ml II Ml II Ml mil I mil mil II iij II 

Db 714 TTGAAGCATCTCAAGGTTGGAAAGTATGAGAAGAGGATCGACGACATCTTGAACAAGTTC 773 
Qy 241 AspProValValGluArgValIleLysLysArgArg***IleValArgArgArg***Asn 260 

IIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIII IIIIIIIIIIIIMI III 

Db 774 GACCCTGTCGTTGAAAGGGTCATCAAGAAGCGCCGTGAGATCGTGAGGAGGAGAAAGAAC 83 3 
Qy 261 GlyGlu******GluGlyGlu***SerGlyVal***LeiiAspThrLeuLeuGluPheAla 280 

1 1 Ml I M III II II IIIIIIIII- Illlllllllllllllllllllll 

Db 834 GGAGAGGTTGTTGAGGGTGAGGTCAGCGGGGTTTTCCTTGACACTTTGCTTGAATTCGCT 8 93 

Qy 281 GluAspGluThr***GluIleLysIleThrLys******IleLysGlyLeuValValAsp 300 

llllllllllll III I Ml III I I III III I I h : M Ml I II III II 1 1 1 

Db 894 GAGGATGAGACCATGGAGATCAAAATCACCAAGGACCACATCGAGGGTCTTGTTGTCGAC 953 

Qy 301 ***PheSerAlaGly***AspSerThrAla******ThrGluTrpAlaLeiiAlaGluLeu 320 

llllllllllll IIMIIIIIIII IIMIIMIIIIimillllMI 

Db 954 TTTTTCTCGGCAGGAACAGACTCCACAGCGGTGGCAACAGAGTGGGCATTGGCAGAACTC 1013 



Qy 321 IleAsnAsnPro***ValLeu******AlaArgGluGlu***TyrSerValValGlyLys 340 

IIIIIIIIIMI mill II III III 111! lllillllllllllllM 

Db 1014 ATCAACAATCCTAAGGTGTTGGAAAAGGCTCGTGAGGAGGTCTACAGTGTTGTGGGAAAG 1073 
Qy 341 Asp***LeuValAspGluValAspThrGlnAsnLeuProTyrIleArgAlaIleValLys 3 60 

III IIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 1074 GACAGACTTGTGGACGAAGTTGACACTCAAAACCTTCCTTACATTAGAGCAATCGTGAAG 1133 
Qy 361 GluThrPheArgMetHisProProLeuProValValLysArgLysCys***GluGluCys 380 

. iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii ii 1 1 II III II 

Db 1134 GAGACATTCCGCATGCACCCGCCACTCCCAGTGGTCAAAAGAAAGTGCACAGAAGAGTGT 1193 
Qy 381 ***IleAsnGly***Val***ProGluGlyAlaLeu******PheAsnValTrpGlnVal 400 

III III I II III lllllllllllllll I III III Ml I Mill II 

Db 1194 GAGATTAATGGATATGTGATCCCAGAGGGAGCATTGATTCTCTTCAATGTATGGCAAGTA 12 53 
Qy 401 Gly***Asp***LysTyrTrpAspArgProSerGlu***ArgProGluArgPheLeuGlu 42 0 

III III II I II IIIIIIIIIMI III III I IIIIIIIIIMIIIIIINII 

Db 12 54 GGAAGAGACCCCAAATACTGGGACAGACCATCGGAGTTCCGTCCTGAGAGGTTCCTAGAG 1313 
Qy 421 Thr***AlaGluGlyGluAla******LeuAspLeuArgGly***HisPheGlnLeuLeu 440 

III IIMIIIIIIMIII lllllllllllllll lllllllllllllll 

Db 1314 ACAGGGGCTGAAGGGGAAGCAGGGCCTCTTGATCTTAGGGGACAACATTTTCAACTTCTC 13 73 
Qy 441 ProPheGlySerGlyArg***MetCysProGlyVal***LeuAlaThrSerGly***Ala 460 

llllllllllllllllll II.IIIIIIMIIIII lllllllllllllll III 

Db 13 74 CCATTTGGGTCTGGGAGGAGAATGTGCCCTGGAGTCAATCTGGCTACTTCGGGAATGGCA 143 3 
Qy 461 ThrLeuLeuAlaSerLeuIleGlnCysPheAspLeuGlnValLeuGlyProGlnGlyGln 480 

I II I II II 1 1 1 1 II I II I II I II I II I M II I II II I II 1 1 II I II I II I II I II II 1 1 1 

Db 1434 ACACTTCTTGCATCTCTTATTCAGTGCTTCGACTTGCAAGTGCTGGGTCCACAAGGACAG 14 93 
Qy 4 81 IleLeuLysGly***AspAlaLysValSerMetGluGluArgAlaGlyLeuThrValPro 50 0 

llllllllllll IIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIII 

Db 14 94 ATATTGAAGGGTGGTGACGCCAAAGTTAGCATGGAAGAGAGAGCCGGCCTCACTGTTCCA 1553 
Qy 501 ArgAlaHisSerLeuValCysValProLeuAlaArglleGlyValAlaSerLysLeuLeu 520 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIM 

Db 1554 AGGGCACATAGTCTTGTCTGTGTTCCACTTGCAAGGATCGGCGTTGCATCTAAACTCCTT 1613 

Qy 521 Ser 521 
III 

Db 1614 TCT 1616 

RESULT 11 
AF195799 

LOCUS AF195799 1824 bp mRNA linear PLN 16-FEB-2000 

DEFINITION Glycine max isoflavone synthase 2 (ifs2) mRNA, complete cds . 

ACCESSION AF195799. 

VERSION AF195799.1 GI: 6979521 

KEYWORDS 

SOURCE Glycine max. 

ORGANISM Glycine max 

Eukaryota; Viridiplantae ; Streptophyta ; Embryophyta; Tracheophyta; 
Spermatophyta; Magnoliophyta; eudicotyledons ; core eudicots; 
Rosidae; eurosids I; Fabales; Fabaceae; Papilionoideae; Phaseoleae; 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



gene 
CDS 



Glycine . 

1 (bases 1 to 1824) 

Jung,W., Yu,0., Lau,S.M., O ' Keef e , D . P . , Odell,J., Fader, G. and 
McGonigle,B. 

Identification and expression of isoflavone synthase, the key 

enzyme for biosynthesis of isoflavones in legumes 

Nat. Biotechnol. 18 (2), 208-212 (2000) 

20124255 

10657130 

2 (bases 1 to 1824) 

Jung,W., Yu,0., Odell,J., Fader, G. and McGonigle,B. 
Direct Submission 

Submitted (18-OCT-1999) Nutrition and Health, DuPont, P.O. Box 
80402, Wilmington, DE 19880-0402, USA 

Location/ Qualifiers 

1. .1824 

/organism= "Glycine max" 

/ db_xr e f = " t axon : 3 8 4 7 " 

1. .1824 

/gene=:"if s2" 

54. .1619 

/gene="if s2" 

/codon_start=l 

/product=" isoflavone synthase 2" 
/protein_id="AAF34 52 0 .1" 
/db_xref = "GI : 6979522 " 

/translation="MLLELALGLLVLALFLHLRPTPTAKSKALRHLPNPPSPKPRLPF 
IGHLHLLKDKLLHYALIDLSKKHGPLFSLYFGSMPTWASTPELFKLFLQTHEATSFN 
TRFQTSAIRRLTYDSSVAMVPFGPYWKFVRKLIMNDLPNATTVNKLRPLRTQQTRKFL 
RVMAQGAEAQKPLDLTEELLKWTNSTISMMMLGEAEEIRDIAREVLKIFGEYSLTDFI 
WPLKHLKVGKYEKRIDDILNKFDPWERVIKKRREIVRRRKNGEWEGEVSGVFLDTL 
LEFAEDETMEIKITKDHIEGLWDFFSAGTDSTAVATEWALAELINNPKVLEKAREEV 
YSWGKDRLVDEVDTQNLPYIRAIVKETFRMHPPLPWKRKCTEECEINGYVIPEGAL 
ILFNVWQVGRDPKYWDRPSEFRPERFLETGAEGEAGPLDLRGQHFQLLPFGSGRRMCP 
GVNLATSGMATLLASLIQCFDLQVLGPQGQILKGGDAKVSMEERAGLTVPRAHSLVCV 
PLARIGVASKLLS" 



BASE COUNT 
ORIGIN 



497 a 



448 c 



416 g 



463 t 



Alignment Scores: 

Pred. No. : 1 . 3e-257 

Score : 2251 . 00 

Percent Similarity: 86.76% 
Best Local Similarity: 86.56% 

Query Match: 93 . 95% 

DB: 8 



Length: 1824 

Matches : 451 

Conservative: 1 

Mismatches: 69 

Indels: 0 

Gaps : 0 



US-09-857-581-66 (1-521) x AF195799 (1-1824) 

Qy 1 MetLeuLeuGluLeuAlaLeuGlyLeu***ValLeuAlaLeuPhe***HisLeuArgPro 2 0 

^ IIIIIIIIIIIIIIIMIIIIIIIIII IMIMIIMMIM IMIIIIIIIII 

Db 54 ATGTTGCTTGAACTTGCACTTGGTTTATTGGTTTTGGCTCTGTTTCTGCACTTGCGTCCC 113 



Qy 21 ThrPro***Ala***SerLysAlaLeuArgHisLeuProAsnProProSerPro***Pro 40 

mill . Ill iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii III 

Db 114 ACACCCACTGCAAAATCAAAAGCACTTCGCCATCTCCCAAACCCACCAAGCCCAAAGCCT 173 



Qy 41 ArgLeuProPheIleGlyHis***HisLeuLeuLysAspLysLeuLeuHisTyrAla*** 60 

1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 174 CGTCTTCCCTTCATAGGACACCTTCATCTCTTAAAAGACAAACTTCTCCACTACGCACTC 233 
Qy 61 IleAspLeuSerLysLysHisGlyProLeuPheSer******PheGlySerMetProThr 80 

I II Ml II Mill I II III II III II I II Ml II II IIIIIIIIIIIMIIIII 

Db 234 ATCGACCTCTCCAAAAAACATGGTCCCTTATTCTCTCTCTACTTTGGCTCCATGCCAACC 2 93 
Qy 81 ValValAlaSerThrProGluLeuPheLysLeuPheLeuGln******GluAlaThrSer 100 

IIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIII IIIIIIIIIIM 

Db 2 94 GTTGTTGCCTCCACACCAGAATTGTTCAAGCTCTTCCTCCAAACGCACGAGGCAACTTCC 353 
Qy 101 Phe***ThrArgPheGlnThrSerAla***Arg***LeuThrTyrAsp******ValAla 120 

III IIIIIIIIIIIIIIIIIIIM Ml IIIIIIIIIMI IIMII 

Db 354 TTCAACACAAGGTTCCAAACCTCAGCCATAAGACGCCTCACCTATGATAGCTCAGTGGCC 413 
Qy 121 ****** Pro * * *Gly ProTy rTrp * * * Phe Va 1 ArgLy sLeuI 1 eMe t AsnAspLeuLeu 140 

III IIIMIIIIIII llllllllllllllllllllllllllllll 

Db 414 ATGGTTCCCTTCGGACCTTACTGGAAGTTCGTGAGGAAGCTCATCATGAACGACCTTCCC 473 
Qy 141 AsnAlaThrThrValAsn***LeiiArgProLeuArgThrGlnGlnIleArgLys***Leu 160 

III II III III II III II Mill III II Ml III II IIMII IIMII III 

Db 474 AACGCCACCACTGTAAACAAGTTGAGGCCTTTGAGGACCCAACAGACCCGCAAGTTCCTT 53 3 
Qy 161 Arg***MetAlaGln***AlaGluAla***LysProLeuAsp***ThrGluGluLeuLeu 180 

Ml II I MUM lllllllll II II III Mill MMMIMMMM 

Db 534 AGGGTTATGGCCCAAGGCGCAGAGGCACAGAAGCCCCTTGACTTGACCGAGGAGCTTCTG 593 
Qy 181 LysTrp***AsnSerThr***SerMetMet***LeuGlyGluAlaGluGluIleArgAsp 200 

IIMII lllllllll lllllllll IIIIIIIIIIIMIIIIIIMIIIIII 

Db 594 AAATGGACCAACAGCACCATCTCCATGATGATGCTCGGCGAGGCTGAGGAGATCAGAGAC 653 
Qy 201 IleAlaArgGluValLeuLysIle***GlyGluTyrSerLeuThrAspPheIle***Pro 220 

Mill III III iiiiin III III II II Illlllllllllllllllllll I III 

Db 654 ATCGCTCGCGAGGTTCTTAAGATCTTTGGCGAATACAGCCTCACTGACTTCATCTGGCCA 713 
Qy 221 LeuLys***LeuLysValGlyLysTyrGluLysArgIleAspAspIleLeuAsnLysPhe 240 

IIMII IIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMM 

Db 714 TTGAAGCATCTCAAGGTTGGAAAGTATGAGAAGAGGATCGACGACATCTTGAACAAGTTC 773 
Qy 241 AspProValValGluArgValIleLysLysArgArg***IleValArgArgArg***Asn 260 

MMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIII MMMIMMMM III 

Db 774 GACCCTGTCGTTGAAAGGGTCATCAAGAAGCGCCGTGAGATCGTGAGGAGGAGAAAGAAC 83 3 
Qy 261 GlyGlu******GluGlyGlu***SerGlyVal***LeuAspThrLeuLeuGluPheAla 280 

mill lllllllll lllllllll llllllllllllllllllllllll 

Db 834 GGAGAGGTTGTTGAGGGTGAGGTCAGCGGGGTTTTCCTTGACACTTTGCTTGAATTCGCT 893 

Qy 281 GluAspGluThr***GluIleLysIleThrLys******IleLysGlyLeuValValAsp 300 

IIIIIIIIIIM IIMII III lllllllll ll|:::|llllllllllllll 
Db 894 GAGGATGAGACCATGGAGATCAAAATCACCAAGGACCACATCGAGGGTCTTGTTGTCGAC 953 

Qy 301 ***PheSerAlaGly***AspSerThrAla******ThrGluTrpAlaLeuAlaGluLeu 320 

llllllllllll Ml II IIMII I lllllllll II Mill III II III 

Db 954 TTTTTCTCGGCAGGAACAGACTCCACAGCGGTGGCAACAGAGTGGGCATTGGCAGAACTC 1013 



Qy 321 IleAsiiAsnPro***ValLeu******AlaArgGluGlu***TyrSerValValGlyLys 340 



Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 
Qy 

Db 

Qy 
Db 

Qy 
Db 



IIIIIIIIIIII llllll IIIIIIIIIIII IIIIIIIIIIIIIIIIII 

1014 ATCAACAATCCTAAGGTGTTGGAAAAGGCTCGTGAGGAGGTCTACAGTGTTGTGGGAAAG 1073 
341 Asp***LeuValAspGluValAspThrGlnAsnLeuProTyrIleArgAlaIleValLys 3 60 

III I MM II I II II II Mill II III II III] Mill II II II IIIIIIIIIIII I 

1074 GACAGACTTGTGGACGAAGTTGACACTCAAAACCTTCCTTACATTAGAGCAATCGTGAAG 113 3 
3 61 GluThrPheArgMetHisProProLeuProValValLysArgLysCys***GluGluCys 3 80 

IIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIII lllllllll 

1134 GAGACATTCCGCATGCACCCGCCACTCCCAGTGGTCAAAAGAAAGTGCACAGAAGAGTGT 1193 



381 



400 



* * * IleAsnGly * * *Val * * *ProGluGlyAlaLeu* * * * * *PheAsnValTrpGlnVal 

lllllllll III II II I I II II I II I I IIIIIIIIIIIIIIIIII 
1194 GAGATTAATGGATATGTGATCCCAGAGGGAGCATTGATTCTCTTCAATGTATGGCAAGTA 1253 

4 01 Gly* * *Asp * * *LysTyrTrpAspArgProSerGlu* * *ArgProGluArgPheLeuGlu 42 0 

III III llllllllllllllllllllllll lllllllllllllllllllll 
1254 GGAAGAGACCCCAAATACTGGGACAGACCATCGGAGTTCCGTCCTGAGAGGTTCCTAGAG 1313 



421 



440 



Thr * * *AlaGluGlyGluAla* * * * **LeuAspLeuArgGly* * *HisPheGlnLeuLeu 

III lllllllllllllll lllllllllllllll lllllllllllllll 

1314 ACAGGGGCTGAAGGGGAAGCAGGGCCTCTTGATCTTAGGGGACAACATTTTCAACTTCTC 13 73 



441 



460 



ProPheGlySerGlyArg* * *Me tCysProGlyVal * * *LeuAlaThrSerGly* * *Ala 

IIMIIIIIMIIIIIII lllllllllllllll lllllllllllllll III 

1374 CCATTTGGGTCTGGGAGGAGAATGTGCCCTGGAGTCAATCTGGCTACTTCGGGAATGGCA 143 3 



461 



480 



ThrLeuLeuAlaSerLeuIleGlnCysPheAspLeuGlnValLeuGlyProGlnGlyGln 

IIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIMIIIIIIIIII lllllllllllll 

1434 ACACTTCTTGCATCTCTTATTCAGTGCTTCGACTTGCAAGTGCTGGGTCCACAAGGACAG 14 93 
4 81 IleLeuLysGly***AspAlaLysValSerMetGluGluArgAlaGlyLeuThrValPro 500 

IIIIIIIIIIII IIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIII 

14 94 ATATTGAAGGGTGGTGACGCCAAAGTTAGCATGGAAGAGAGAGCCGGCCTCACTGTTCCA 1553 
501 ArgAlaHisSerLeuValCysValProLeuAlaArglleGlyValAlaSerLysLeuLeu 520 

IIIIIIIM II lllllllll III II MM III II II II lllllllll III MM III III 

1554 AGGGCACATAGTCTTGTCTGTGTTCCACTTGCAAGGATCGGCGTTGCATCTAAACTCCTT 1613 



Qy 521 Ser 521 



Db 1614 TCT 1616 



RESULT 12 

AF195819 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 



AF195819 1902 bp DNA linear PLN 23-MAR-2000 

Glycine max isoflavone synthase 2 (ifs2) gene, complete cds . 
AF195819 

AF195819.1 01:7288454 

Glycine max. 
Glycine max 

Eukaryota ; Viridiplant ae ; S t reptophyt a ; Embryophy t a ; Tracheophyt a ; 
Spermatophyta; Magnoliophyta; eudicotyledons ; core eudicots; 
Rosidae; eurosids I; Fabales; Fabaceae; Papilionoideae ; Phaseoleae; 
Glycine . 

1 (bases 1 to 1902) 



AUTHORS 
TITLE 

JOURNAL 
. MEDLINE 
PUBMED 
REFERENCE 

AUTHORS 

TITLE 

JOURNAL 

FEATURES 

source 

gene 
mRNA 

CDS 



Jung,W., Yu,0., Lau,S.M., O ' Keef e , D . P . , Odell,J., Fader, G. and 
McGonigle,B. 

Identification and expression of isoflavone synthase, the key 

enzyme for biosynthesis of isoflavones in legumes 

Nat. Biotechnol. 18 (2), 208-212 (2000) 

20124255 

10657130 

2 (bases 1 to 1902) 

Jung,W., Yu,0., Odell,J., Fader, G. and McGonigle,B. 
Direct Submission 

Submitted (18-OCT-1999) Nutrition and Health, DuPont, PO Box 80402, 
Wilmington, DE 19880-0402, USA 

Location/Qualifiers 

1. .1902 

/organism= "Glycine max" 
/ db_xr e f = " t axon : 3 8 4 7 " 
1. .1902 
/gene="if s2 " 

joind. .948,1085. .1902) 
/gene="if s2 " 

/product=" isoflavone synthase 2" 

join(52. .948,1085. .1753) 

/gene="ifs2" 

/note=" cytochrome P450" 

/codon_start=l 

/product=" isoflavone synthase 2" 
/protein_id="AAF4514 3 .1" 
/db_xref="GI : 7288455" 

/translation=" MLLELALGLLVLALFLHLRPTPTAKS KALRHLPNPPS PKPRLPF 
IGHLHLLKDKLLHYALIDLSKKHGPLFSLYFGSMPTWASTPELFKLFLQTHEATSFN 
TRFQTSAIRRLTYDSSVAMVPFGPYWKFVRKLIMNDLPNATTVNKLRPLRTQQTRKFL 
RVMAQGAEAQKPLDLTEELLKWTNSTISMMMLGEAEEIRDIAREVLKIFGEYSLTDFI 
WPLKHLKVGKYEKRIDDILNKFDPWERVIKKRREIVRRRKNGEWEGEVSGVFLDTL 
LEFAEDETMEIKITKDHIEGLWDFFSAGTDSTAVATEWALAELINNPKVLEKAREEV- 
YSWGKDRLVDEVDTQNLPYIRAIVKETFRMHPPLPWKRKCTEECEINGYVIPEGAL 
ILFNVWQVGRDPKYWDRPSEFRPERFLETGAEGEAGPLDLRGQHFQLLPFGSGRRMCP 
GVNLATSGMATLLASLIQCFDLQVLGPQGQILKGGDAKVSMEERAGLTVPRAHSLVCV 
PLARIGVASKLLS" 



BASE COUNT 517 a 


455 c 


440 g 490 t 




ORIGIN 








Alignment Scores : 








Pred. No. : 


1. 06e-251 


Length : 


1902 


Score : 


2201.50 


Matches : 


451 


Percent Similarity: 


79.72% 


Conservative : 


1 


Best Local Similarity: 


79.54% 


Mismatches : 


69 


Query Match: 


91.88% 


Indels : 


46 


DB: 


8 


Gaps : 


1 



US-09-857-581-66 (1-521) x AF195819 (1-1902) 



Qy 

Db 

Qy 



1 MetLeuLeuGluLeuAlaLeuGlyLeu***ValLeuAlaLeuPhe***HisLeuArgPro 20 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

5 2 ATGTTGCTTGAACTTGCACTTGGTTTATTGGTTTTGGCTCTGTTTCTGCACTTGCGTCCC 111 



2 1 ThrPro* * *Ala* * *SerLysAlaLeuArgHisLeuProAsnProProSerPro* * *Pro 

mill III iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii III 



40 



Db 112 ACACCCACTGCAAAATCAAAAGCACTTCGCCATCTCCCAAACCCACCAAGCCCAAAGCCT 171 
Qy 41 ArgLeuProPhell eGlyHi s * * *Hi sLeuLeuLy s AspLy sLeuLeuHi sTyr Ala* * * 60 

1 1 1 1 1 1 1 1 1 1 1 1 M I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 172 CGTCTTCCCTTCATAGGACACCTTCATCTCTTAAAAGACAAACTTCTCCACTACGCACTC 231 
Qy 61 IleAspLeuSerLysLysHisGlyProLeuPheSer******PheGlySerMetProThr 80 

IIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIII llllllllllllllllll 

Db 232 ATCGACCTCTCCAAAAAACATGGTCCCTTATTCTCTCTCTACTTTGGCTCCATGCCAACC 291 
Qy 81 ValValAlaSerThrProGluLeuPheLysLeuPheLeuGln******GluAlaThrSer 100 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIII IIIIIIIMIII 

Db 2 92 GTTGTTGCCTCCACACCAGAATTGTTCAAGCTCTTCCTCCAAACGCACGAGGCAACTTCC 351 
Qy 101 Phe***ThrArgPheGlnThrSerAla***Arg***LeuThrTyrAsp******ValAla 120 

III lllllllllllllllllllll III llllllllllll MINI 

Db 3 52 TTCAACACAAGGTTCCAAACCTCAGCCATAAGACGCCTCACCTATGATAGCTCAGTGGCC 411 
Qy 121 ******Pro***GlyProTyrTrp***PheValArgLysLeuIleMetAsnAspLeuLeu 140 

III llllllllllll IIIMIIIIIIIIIIIIIIIIIIIIIIIII 

Db 412 ATGGTTCCCTTCGGACCTTACTGGAAGTTCGTGAGGAAGCTCATCATGAACGACCTTCCC 471 

Qy 141 AsnAlaThrThrValAsn***LeuArgProLeuArgThrGlnGlnIleArgLys***Leu 160 

llllllllllllllllll I I I I I I I I I I I I I I I I I I I I I I I I llllll III 
Db 472 AACGCCACCACTGTAAACAAGTTGAGGCCTTTGAGGACCCAACAGACCCGCAAGTTCCTT 531 

Qy 161 Arg***MetAlaGln***AlaGliiAla***LysProLeuAsp***ThrGluGluLeuLeu 180 

III mil Ml I I II I II I IJ llllllllllll III II Mill I MM 

Db 532 AGGGTTATGGCCCAAGGCGCAGAGGCACAGAAGCCCCTTGACTTGACCGAGGAGCTTCTG 591 

Qy 181 LysTrp***AsnSerThr***SerMetMet***LeuGlyGluAlaGluGluIleArgAsp 200 

llllll lllllllll lllllllll lllllllllllllllllllllllllll 
Db 5 92 AAATGGACCAACAGCACCATCTCCATGATGATGCTCGGCGAGGCTGAGGAGATCAGAGAC 651 

Qy 201 IleAlaArgGluValLeuLysIle***GlyGluTyrSerLeuThrAspPheIle***Pro 220 

IIIIIIMIIIIIIIIIIIIIIII III II Mill III III II IM II MM Ml 

Db 652 ATCGCTCGCGAGGTTCTTAAGATCTTTGGCGAATACAGCCTCACTGACTTCATCTGGCCA 711 
Qy 221 LeuLys***LeuLysValGlyLysTyrGluLysArgIleAspAspIleIjeuAsnLysPhe 240 

MUM IIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIII 

Db 712 TTGAAGCATCTC AAGGTTGGAAAGTATGAGAAGAGGATCGACGAC ATCTTGAACAAGTTC 771 
Qy 241 AspProValValGluArgValIleLysLysArgArg***IleValArgArgArg***Asn 260 

II II II 1 1 1 II I M II II II II I II 1 1 II 1 1 1 1 1 II 1 1 1 II I II II I II II III 

Db 772 GACCCTGTCGTTGAAAGGGTCATCAAGAAGCGCCGTGAGATCGTGAGGAGGAGAAAGAAC 831 

Qy 261 GlyGlu******GluGlyGlu***SerGlyVal***LeuAspThrLeuLeuGluPheAla 280 

llllll lllllllll lllllllll llllllllllllllllllllllll 

Db 832 GGAGAGGTTGTTGAGGGTGAGGTCAGCGGGGTTTTCCTTGACACTTTGCTTGAATTCGCT 8 91 

Qy 281 GluAspGluThr***GluIleLysIleThrLys******IleLysGlyLeuValVal 299 

IIIIIMIIIII IIIIIIIIIIIIIIMII llh::|IMIIIIIIII 
Db 892 GAGGATGAGACCATGGAGATCAAAATCACCAAGGACCACATCGAGGGTCTTGTTGTCGTG 951 

Qy 299 299 

Db 952 AGTTTCCTGCTTCATTCATTGATCGAAATATGCAGTATTTTGTTAACAAGAGATCGAGAA .1011 



Qy 



299 



299 



Db 1012 TTGACATTTATATATTCATGTGGTGGCAATTAATTAACGGTACGCATTCTTAATCGATAT 1071 
Qy 300 Asp***PheSerAlaGly***AspSerThrAla******ThrGluTr 315 

III 1 1 1 1 1 1 1 1 1 II I 1 1 1 II 1 1 1 II 1 1 I II 1 1 1 1 r 

Db 1072 TGTGTATGTGCAGGACTTTTTCTCGGCAGGAACAGACTCCACAGCGGTGGCAACAGAGTG 1131 
Qy 315 pAlaLe\iAlaGluLeuIleAsnAsnPro***ValLeu******AlaArgGluGlu***Ty 335 

IMIIIIIIIIIIIIIIIIIIIIIIIII IMIII llllllllllll II 

Db 1132 GGCATTGGCAGAACTCATCAACAATCCTAAGGTGTTGGAAAAGGCTCGTGAGGAGGTCTA 1191 
Qy 335 rSerValValGlyLysAsp***LeuValAspGluValAspThrGlnAsnLeuProTyrIl 355 

IIMIIIIIIIMIIIIII IMIMIIIIIIIIIIIIIIIIIIMIIIIIIIIIIII 

Db . 1192 CAGTGTTGTGGGAAAGGACAGACTTGTGGACGAAGTTGACACTCAAAACCTTCCTTACAT 1251 
Qy 355 eArgAlalleValLysGluThrPheArgMetHisProProLeuProValValLysArgLy 375 

iililillllllillllllllillliiiiiiiiiiiiiiiiiiilliiiiiiiiiiliii 

Db 1252 TAGAGCAATCGTGAAGGAGACATTCCGCATGCACCCGCCACTCCCAGTGGTCAAAAGAAA 1311 
Qy 375 sCys***GluGluCys***IleAsnGly***Val***ProGluGlyAlaLeu******Ph 395 

MM MMMMI MMMMI Ml lllllllllllllll II 

Db 1312 GTGCACAGAAGAGTGTGAGATTAATGGATATGTGATCCCAGAGGGAGCATTGATTCTCTT 13 71 
Qy 395 eAsnValTrpGlnValGly***Asp***LysTyrTrpAspArgProSerGlu***ArgPr 415 

II II III II MM mil I Ml I II IIIIMIIIIII IMIIMII Mill 

Db 1372 CAATGTATGGCAAGTAGGAAGAGACCCCAAATACTGGGACAGACCATCGGAGTTCCGTCC 1431 
Qy 415 0GliiArgPheLeuGluThr***AlaGluGlyGluAla******LeuAspLeuArgGly** 43 5 

IIIIIIIIIIIIIIMMI MIMIIMIIIIII IIIIIIIIIMIIII 

Db 1432 TGAGAGGTTCCTAGAGACAGGGGCTGAAGGGGAAGCAGGGCCTCTTGATCTTAGGGGACA 14 91 
Qy 435 *HisPheGlnLeuLeuProPheGlySerGlyArg***MetCysProGlyVal***LeuAl 455 

IIIIIIIIIIIIIIIMIIIIIIIIIIIIIIII lllllllllllllll Mill 

Db 14 92 ACATTTTCAACTTCTCCCATTTGGGTCTGGGAGGAGAATGTGCCCTGGAGTCAATCTGGC 1551 
Qy 455 aThrSerGly***AlaThrLeuLeuAlaSerLeuIleGlnCysPheAspLeuGlnValLe 475 

MMMMI! Illllllllllllllllllllllllllllllllllllllllllllll 

Db 1552 TACTTCGGGAATGGCAACACTTCTTGCATCTCTTATTCAGTGCTTCGACTTGCAAGTGCT 1611 
Qy 475 uGlyProGlnGlyGlnIleLeuLysGly***AspAlaLysValSerMetGluGl\iArgAl 495 

llllllllilllliiiiiiiiillllll ilillllllliiliillllliiliiiiii 

Db 1612 GGGTCCACAAGGACAGATATTGAAGGGTGGTGACGCCAAAGTTAGCATGGAAGAGAGAGC 1671 
Qy 495 aGlyLeuThrValProArgAlaHisSerLeuValCysValProLeuAlaArglleGlyVa 515 

lllllllllllllll IMIII llllllllllllllllllllll MM Ml llllllllll 

Db 1672 CGGCCTCACTGTTCCAAGGGCACATAGTCTTGTCTGTGTTCCACTTGCAAGGATCGGCGT 1731 
Qy 515 lAlaSerLysLeuLeuSer 521 

IIIMIIMMIIMIIII 

Db 1732 TGCATCTAAACTCCTTTCT 1750 

RESULT 13 
AF195818 

LOCUS AF195818 1800 bp DNA linear PLN 23-MAR-: 

DEFINITION Glycine max isoflavone synthase 1 (if si) gene, partial cds . 



ACCESSION AF195 818 

VERSION AF195818.1 01:7288452 

KEYWORDS 

SOURCE Glycine max. 

ORGANISM Glycine max 

Eukaryota ; Vir idiplant ae ; S t reptophyt a ; Embryophy ta ; Tracheophyta ; 
Spermatophyta; Magnoliophyta; eudicotyledons ; core eudicots; 
Rosidae; eurosids I; Fabales; Fabaceae; Papilionoideae; Phaseoleae; 
Glycine . 

REFERENCE 1 (bases 1 to 1800) 

AUTHORS Jung,W., Yu,0., Lau,S.iyi., 0 ' Keef e , D . P . , Odell,J., Fader, G. and 
McGonigle,B. 

TITLE Identification and expression of isoflavone synthase, the key 

enzyme for biosynthesis of isoflavones in legumes 
JOURNAL Nat. Biotechnol. 18 (2), 208-212 (2000) 
MEDLINE 20124255 
PUBMED 10657130 
REFERENCE 2 (bases 1 to 1800) 

AUTHORS Jung,W., Yu,0., Odell,J., Fader, G. and McGonigle,B. 
TITLE Direct Submission 

JOURNAL Submitted ( 18 -OCT-1999) Nutrition and Health, DuPont, PO Box 80402, 
Wilmington, DE 19880-0402, USA 
FEATURES Location/Qualifiers 
source 1. .1800 

/organism= "Glycine max" 
/db_xref ="taxon:3 847" 
gene <1. .>1800 

/gene="if si" 
mRNA join(<l. .893,1112. .>1800) 

/gene="ifsl" 

/product=" isoflavone synthase 1" 
CDS join(<l. .893,1112. .1780) 

/gene="ifsl" 
/note=" cytochrome P450" 
/codon_start=3 

/product=" isoflavone synthase 1" 
/protein_id="AAF45142 .1" 
/db_xref="GI : 7288453" 

/ 1 rans 1 a t i on= " LELALGLFVLAL FLHLRPTPS AKS KALRHLPNPPS PKPRL PF IG 
HLHLLKDKLLHYALIDLSKKHGPLFSLSFGSMPTWASTPELFKLFLQTHEATSFNTR 
FQTSAIRRLTYDNSVAMVPFGPYWKFVRKLIMNDLLNATTVNKLRPLRTQQIRKFLRV 
MAQSAEAQKPLDVTEELLKWTNSTISMMMLGEAEEIRDIAREVLKIFGEYSLTDFIWP 
LKYLKVGKYEKRIDDILNKFDPWERVIKKRREIVRRRKNGEWEGEASGVFLDTLLE 
FAEDETMEIKITKEQIKGLWDFFSAGTDSTAVATEWALAELINNPRVLQKAREEVYS 
WGKDRLVDEVDTQNLPYIRAIVKETFRMHPPLPWKRKCTEECEINGYVIPEGALVL 
FNVWQVGRDPKYWDRPSEFRPERFLETGAEGEAGPLDLRGQHFQLLPFGSGRRMCPGV 
NLATSGMATLLASLIQCFDLQVLGPQGQILKGDDAKVSMEERAGLTVPRAHSLVCVPL 
ARIGVASKLLS" . 



BASE COUNT 4 76 a 


441 c 
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Pred. No . : 


3 .04e-250 


Length: 


1800 


Score : 
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Matches : 
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76.18% 
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DB : 8 Gaps : 1 

US-09-857-581-66 (1-521) x AF195818 (1-1800) 

Qy 3 LeuGluLeuAlaLeuGlyLeu***ValLeuAlaLeuPhe***HisLeuArgProThrPro 22 

IIIIIIIIIIIIIIIIIIIII Illllllllllllll IIIMIIIIIIIMIIII 

Db 3 CTGGAACTTGCACTTGGTTTGTTTGTGTTAGCTTTGTTTCTGCACTTGCGTCCCACACCA 62 

Qy 2 3 ***Ala***SerLysAlaLeuArgHisLeuProAsnProProSerPro***ProArgLeu 42 

III MIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII IMMIIII 

Db 63 AGTGCAAAATCAAAAGCACTTCGCCACCTCCCAAACCCTCCAAGCCCAAAGCCTCGTCTT 122 

Qy ■ 43 ProPheIleGlyHis***HisLeuLeuLysAspLysLeuLeuHisTyrAla***IleAsp 62 

IIIIIMIIIIIIM lllllllllllllllllllllllllllllllll llllll 

Db 123 CCCTTCATTGGCCACCTTCACCTCTTAAAAGATAAACTTCTCCACTATGCACTCATCGAT 182 
Qy 63 LeuSerLysLysHisGlyProLeuPheSer******PheGlySerMetProThrValVal 82 

IIIIIIIMIIIIIIIIIIIIIIIIIIIII IIMIIIIIIIIIIIMIIIIIII 

Db 183 CTCTCCAAAAAGCATGGCCCCTTATTCTCTCTCTCCTTCGGCTCCATGCCAACCGTCGTT 242 
Qy 83 AlaSerThrProGluLeuPheLysLeuPheLeuGln******GluAlaThrSerPhe*** 102 

IIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIII Illllllllllllll 

Db 243 GCCTCCACCCCTGAGTTGTTCAAGCTCTTCCTCCAAACCCACGAGGCAACTTCCTTCAAC 3 02 
Qy 103 ThrArgPheGlnThrSerAla***Arg***LeuThrTyrAsp******ValAla****** 122 

IIIIIIIIIIIIIIIIIIIII III IIIIIIIMIM III Ml 

Db 303 ACAAGGTTCCAAACCTCTGCCATAAGACGCCTCACTTACGACAACTCTGTGGCCATGGTT 3 62 
Qy 12 3 Pro* * *GlyProTyrTrp* * *PheValArgLysLeuIleMetAsnAspLeuLeuAsnAla 142 

III 1 1 1 1 M 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 

Db 363 CCATTCGGACCTTACTGGAAGTTCGTGAGGAAGCTCATCATGAACGACCTTCTCAACGCC 422 
Qy 14 3 ThrThrValAsn* * *LeuArgProLe\iArgThrGlnGlnIleArgLys * * *LeuArg* * * 162 

llllllllllll II III II II IIIIIIIIIIIIIIIIIIIIIIM llllll 

Db 423 ACCACCGTCAACAAGCTCAGGCCTTTGAGGACCCAACAGATCCGCAAGTTCCTTAGGGTT 4 82 
Qy 163 MetAlaGln***AlaGliiAla***LysProLeuAsp***ThrGluGluLeuLeuLysTrp 182 

MMMII I IIIIIIIM MM Mill 1 11 II I II 1 1 M 1 1 II I II 1 1 1 1 1 

Db 4 83 ATGGCCCAAAGCGCAGAGGCCCAGAAGCCCCTTGACGTCACCGAGGAGCTTCTCAAATGG 542 
Qy 183 ***AsnSerThr***SerMetMet***LeuGlyGluAlaGluGluIleArgAspIleAla 202 

IIIIIMII lllllllll IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 543 ACCAACAGCACCATCTCCATGATGATGCTCGGCGAGGCTGAGGAGATCAGAGACATCGCT 602 
Qy 203 ArgGluValLeuLysIle***GlyGluTyrSerLeuThrAspPheIle***ProLeuLys 222 

iiiiiiiiiiiiiiiiii iiliiiiiiiiiiiiiiiiiiiiiiii lllllllll 

Db 603 CGCGAGGTTCTTAAGATCTTCGGCGAATACAGCCTCACTGACTTCATCTGGCCTTTGAAG 662 
Qy 223 ***LeuLysValGlyLysTyrGluLysArgIleAspAspIleLeuAsnLysPheAspPro 242 

llllllliillllllllllllllllllllillllllllllllllllliillllllll 

Db 663 TATCTCAAGGTTGGAAAGTATGAGAAGAGGATTGATGACATCTTGAACAAGTTCGACCCT 722 
Qy 243 ValValGluArgValIleLysLysArgArg***IleValArgArgArg***AsnGlyGlu 262 

IIIIMIIIIIIIIIIIIIIIIIIIIIIII IIIIIIIIIMIIII lllllllll 

Db 72 3 GTCGTTGAAAGGGTCATCAAGAAGCGCCGTGAGATCGTCAGAAGGAGAAAGAACGGAGAA 782 



Qy 263 ******GluGlyGlu***SerGlyVal***LeiiAspThrLeuLeuGluPheAlaGluAsp 282 
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lllllllll IIIIMIII lllllllllllllllllllllllllllll 



GluThr * * *GluIleLysIleThrLys ****** IleLysGlyLeuValVal 299 

mill iiiiiiiiiiiiiiiiii lllllllllllllll 



299 



299 



299 



-Asp* * * PheSerAlaGly* * *AspSerThr 3 09 

III llllllllllll lllllllll 



III III I Mill IIIIIIIIIIIIIIIIII III III III mill 

GCGGTGGCAACAGAGTGGGCATTGGCAGAGCTCATCAACAATCCCAGGGTGTTGCAAAAG 
AlaArgGluGlu***TyrSerValValGlyLysAsp***LeuValAspGluValAspThr 

llllllllllll II Mill Mill lllllllll llllllllllll II llllll'l 

GCTCGTGAGGAGGTCTACAGTGTTGTGGGCAAAGATAGACTCGTTGACGAAGTTGACACT 
GlnAsnLeuProTyrlleArgAlalleValLysGluThrPheArgMetHisProProLeu 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

CAAAACCTTCCTTACATTAGGGCCATTGTGAAGGAGACATTCCGAATGCACCCACCACTC 
ProValValLysArgLysCys * * *GluGluCys * * * I leAsnGly * * * Val * * * ProGlu 

iiiiiiiiiiiiiiiiiiiii lllllllll lllllllll III mill 

CCAGTGGTCAAAAGAAAGTGCACAGAAGAGTGTGAGATTAATGGGTATGTGATCCCAGAG 
GlyAlaLeu* * * * * *PheAsnValTrpGlnValGly* * *Asp* * *LysTyrTrpAspArg 

IIMIIIII IIIIIIIIIIIMMIIIIII III lllllllllllllll 

GGAGCATTGGTTCTTTTCAATGTTTGGCAAGTAGGAAGGGACCCCAAATACTGGGACAGA 
ProSerGlu* * *ArgProGluArgPheLeuGluThr * * *AlaGluGlyGliiAla* * * * * * 

lllllllll lllliilillllilllllilllll lllllllllllllll 

CCATCAGAATTCCGTCCCGAGAGGTTCTTAGAAACTGGTGCTGAAGGGGAAGCAGGGCCT 
LeuAspLeuArgGly * * *HisPheGlnLeuLeuProPheGlySerGlyArg* * *MetCys 

lllllllllllllll iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii mm 

CTTGATCTTAGGGGCCAGCATTTCCAACTCCTCCCATTTGGGTCTGGGAGGAGAATGTGC 
ProGlyVal * * *LeuAlaThrSerGly * * *AlaThrLeuLeuAlaSerLeuIleGlnCys 

lllllllll iiimiiiiiiiii llllllllllllllllllllllllllllll 

CCTGGTGTCAATTTGGCTACTTCAGGAATGGCAACACTTCTTGCATCTCTTATCCAATGC 
PheAspLeuGlnValLeuGlyProGlnGlyGlnlleLeuLysGly* * *AspAlaLysVal 

mm lllllllllllllll iiiiiiiiiiiiiiiiiiiiiiii iiiiiimiii 



Db 



1622 TTTGACCTGCAAGTGCTGGGCCCTCAAGGACAAATATTGAAAGGTGATGATGCCAAAGTT 1681 



Qy 490 SerMetGluGluArgAlaGlyLeuThrValProArgAlaHisSerLeuValCysValPro 509 

IIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 1682 AGCATGGAAGAGAGAGCTGGCCTCACAGTTCCAAGGGCACATAGTCTCGTTTGTGTTCCA 1741 
Qy 510 LeuAlaArglleGlyValAlaSerLysLeuLeuSer 521 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMII 

Db 1742 CTTGCAAGGATCGGCGTTGCATCTAAACTCCTTTCT 1777 
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AF195801 1501 bp mRNA linear PLN 16-FEB-2000 

Medicago sativa isoflavone synthase 2 (ifs2) mRNA, partial cds . 
AF195801 

AF195801.1 GI:6979525 

Medicago sativa. 
Medicago sativa 

Eukaryota ; Viridiplantae ; S t reptophyta ; Embryophyt a ; Tracheophyt a ; 
Spermatophyta; Magnoliophyta; eudicotyledons ; core eudicots; 
Rosidae; eurosids I; Fabales; Fabaceae; Papilionoideae ; Trifolieae; 
Medicago, 

1 (bases 1 to 1501) 

Jung,W., Yu,0., Lau,S.M., 0'Keefe,D.P. , ' Odell, J. , Fader, G. and 
McGonigle,B. 

Identification and expression of isoflavone synthase, the key 

enzyme for biosynthesis of isoflavones in legumes 

Nat. Biotechnol. 18 (2), 208-212 (2000) 

20124255 

10657130 

2 (bases 1 to 1501) 

Jung,W., Yu,0., Odell, J., Fader, G. and McGonigle,B. 
Direct Submission 

Submitted (18-OCT-1999) Nutrition and Health, DuPont, P.O. Box 
80402, Wilmington, DE 19880-0402, USA 

Location/ Qualifiers 

1. .1501 

/organism= "Medicago sativa" 

/db_xref="taxon:3879" 

<1. .>1501 

/gene="if s2" 

<1. .>1501 

/gene="ifs2" 

/codon_start=3 

/product=" isoflavone synthase 2" 
/protein_id="AAF34522 .1" 
/db_xref="GI : 6979526" 

/translation="FLHLRPTPTAKSKALRHLPNPPSPKPRLPFIGHLHLLKDKLLHY 
ALIDLSKKHGPLFSLYFGSMPTWASTPELFKLFLQTHEATSFNTRFQTSAIRRLTYD 
SSVAMAPFGPYWKFVRKLIMNDLLNATTVNKLRPLRTQQIRKFLRVMAQGAEAQKPLD 
LTEELLKWTNSTTSMMMLGEAEEIRDIAREVLKIFGEYSLTDFIRPLKHLKVGKYEKR 
I DD I LNKFDP WERVI KKRRE I VRRRKNGE WEGE VSGVFLDTLLE FAEDETTE I KI T 
KDHIKGLWDFFSAGTDSTAVATEWALAELINNPKVLEKAREEVYSWGKDRLVDEVD 
TQNLPYIRAIVKETFRMHPPLPWKRKCTEECEINGYVIPEGALILFNVWQVGRDSKY 
WDRPSEFRPERFLETGAEGEARPLDLRGQHFQLLPFGSGRRMCPGVNLATSGMATLLA 
S L I QC FDLQVLG PQGQ I L KGGDAKVS ME ERAGLTVPRAHS L VCVPLAR I G " 



